Comparative Analysis of Text Vectorization Methods

نویسندگان

چکیده

The paper considers methods of vectorization textual properties natural language in the context task intellectual text analysis. most common statistical analysis feature extraction and that taking into account are analyzed. work describes above types embeddings their variations implementations. Their comparative was performed, which showed relationship between type method showing best metrics. topology neural network, is basis for solving problem obtaining metrics, described, implemented. carried out using relative time theory algorithms classification metrics: accuracy, f1-score, precision, recall. metrics taken from results building a network model described framing methods. As result, analyzing tonality text, based on n-grams character sequences turned to be best.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Analysis of Supervised Multi-label Text Classification Methods

Multi-label classification methods are getting more popular now a days because of their increasing demand in various application domains such as text classification , image classification , functional genomics , music categorization , emotion recognition etc. Multi-label classification methods are falling under two broader categories of problem transformation methods and algorithm adaptation me...

متن کامل

Sentiment analysis methods in Sentiment analysis methods in Persian text: A survey

With the explosive growth of social media such as Twitter, reviews on e-commerce website, and comments on news websites, individuals and organizations are increasingly using opinions in these media for their decision making. Sentiment analysis is one of the techniques used to analyze userschr('39') opinions in recent years. Persian language has specific features and thereby requires unique meth...

متن کامل

a comparative pragmatic analysis of the speech act of “disagreement” across english and persian

the speech act of disagreement has been one of the speech acts that has received the least attention in the field of pragmatics. this study investigates the ways power relations, social distance, formality of the context, gender, and language proficiency (for efl learners) influence disagreement and politeness strategies. the participants of the study were 200 male and female native persian s...

15 صفحه اول

comparative dna interaction studies of antiviral drug, zidovudine and its complex using different instrumental methods

هدف از این مطالعه بررسی امکان استفاده از داروهای شناخته شده در درمان سایر بیماریها به عنوان داروهای ضد سرطان است. همچنین با استفاده از این داروها در ساختمان کمپلکس فلز می توان شاخص های دارویی بدست آمده را بررسی نمود. داروی ضد ویروس ایدز(hiv)به نام زیدوودین(azt)انتخاب و.کمپلکس.محلول.در.آب[pt(azt)2]cl2سنتزو به روشهای مختلف فیزیکی و شیمیایی شناسایی گردید. بر هم کنش مقایسه ای این دارو و کمپلکس پلا...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Elektronìka ta sistemi upravlìnnâ

سال: 2023

ISSN: ['1990-5548']

DOI: https://doi.org/10.18372/1990-5548.76.17663